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Chapter 1 

Introduction to Carbon 
Materials 

The experimental and theoretical study of graphene, two-dimensional (2D) 
graphite, is an extremely rapidly growing field of today's condensed matter 
research. A milestone was the experimental evidence of an unusual quan- 
tum Hall effect reported in September 2005 by two different groups, the 
Manchester group led by Andre Geim and a Columbia-Princeton collabora- 
tion led by Philip Kim and Horst Stormer [1, 2]. Since this moment and 
until March 2008, the beginning of the present series of lectures, almost 700 
manuscripts with "graphene" in their title have been posted on the preprint 
server www.arXiv.org (see Fig. 1.1). 

The reasons for this enormous scientific interest are manyfold, but one 
may highlight some major motivations. First, one may emphasise its possible 
technological potential. One of the first publications on graphene in 2004 by 
the Geim group proved indeed the possibility of an electric field effect in 
graphene, i.e. the possibility to control the carrier density in the graphene 
sheet by simple application of a gate voltage [3]. This effect is a fundamental 
element for the design of electronic devices. Today's silicon-based electronics 
reaches its limits in miniaturisation, which is on the order of 50 nm for an 
electric channel, whereas it has been shown that a narrow graphene strip 
with a width of only a few nanometers may be used as a transistor [4], 
i.e. the basic electronics component. One may therefore hope to improve 
the miniaturisation by one order of magnitude when using graphene-based 
electronics. 

Apart from these promising technological applications, two major moti- 
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Figure 1.1: Number of manuscripts with "graphenc" in the title posted on the preprint 
server. In interpreting these numbers, one must, however, consider that several publica- 
tions on graphenc appeared before 2006, e.g. in the framework of carbon-nanotube or 
graphite research. At this moment, the name "graphene" was not commonly used. 

vations for fundamental research may be emphasised. Graphene is the first 
truely 2D crystal ever observed in nature. This is remarkable because the 
existence of 2D crystals has often been doubted in the past, namely due to 
a theorem (Mermin- Wagner theorem) which states that a 2D crystal looses 
its long-range order, and thus melts, at any small but non-zero temperature, 
due to thermal fluctuations. Furthermore, electrons in graphene show rel- 
ativistic behaviour, and the system is therefore an ideal candidate for the 
test of quantum-field theoretical models which have been developed in high- 
energy physics. Most promenently, electrons in graphene may be viewed as 
massless charged fermions living in 2D space, particles one usually does not 
encounter in our three-dimensional world. Indeed, all massless elementary 
particles happen to be electrically neutral, such as photons or neutrinos. 1 
Graphene is therefore an exciting bridge between condensed-matter and high- 
energy physics, and the research on its electronic properties unites scientists 
with various thematic backgrounds. The discussion of graphene's electronic 
properties and how such relativistic effects are revealed in electric transport 
measurements is naturally a prominent part of the present lecture notes. 

The interest in graphene is not only limited to the scientific community. 
An important number of large-audience articles have recently been published. 
The following list of (more or less serious) citations reveals this broad interest 




1 The neutrino example is only partially correct. It has been shown that neutrinos must 
indeed have a tiny non-zero mass. 
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"Electrons travel through it so fast that their behaviour is gov- 
erned by the theory of relativity rather than classical physics." 

(The Economist, 2006) 

"Inside every pencil, there is a neutron star waiting to get out." 

(New Scientist, 2006) 

"We'll have to rewrite the theory of metals for this problem." 

(Physics Today, 2006) 

Notice that the last citation is from a leading scientist in the field and may 
indeed serve as a guideline for these lectures. 2 

1.1 The Carbon Atom and its Hybridisations 

Carbon, the elementary constituent of graphene and graphite, is the 6th 
element of the periodic table. Its atom is, therefore, built from 6 protons, 
A neutrons, and 6 electrons, where A = 6 and 7 yield the stable isotopes 
12 C and 13 C, respectively, and A = 8 characterises the radioactive isotope 
14 C. The isotope 12 C, with a nuclear spin I = 0, is the most common one 
in nature with 99% of all carbon atoms, whereas only 1% are 13 C with a 
nuclear spin / = 1/2. There are only traces of 14 C (10~ 12 of all carbon 
atoms) which /3-decays into nitrogen 14 N. Although 14 C only occurs rarely, 
it is an important isotope used for historical dating (radiocarbon). Due to 
its half-life of 5 700 years, which corresponds to a reasonable time scale in 
human history, measurement of the 14 C concentration of an organic material, 
mainly wood, allows one to date its biological activity up to a maximum age 
of roughly 80 000 years. In general, carbon is the elementary building block 
of all organic molecules and, therefore, responsible for life on Earth. 

In the atomic ground state, the 6 electrons are in the configuration 
ls 2 2s 2 2p 2 , i.e. 2 electrons fill the inner shell Is, which is close to the nucleus 
and which is irrelevant for chemical reactions, whereas 4 electrons occupy the 
outer shell of 2s and 2p orbitals. Because the 2p orbitals (2p x , 2p y , and 2p 2 ) 
are roughly 4 eV higher than the 2s orbital, it is energetically favourable to 

2 However, our aim is more modest - we do not intend to "rewrite" but rather to "apply" 
the theory of metals to graphene. 
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Figure 1.2: Electronic configurations for carbon in the ground state (left) and in the 
excited state (right). 



put 2 electrons in the 2s orbital and only 2 of them in the 2p orbitals (Fig 
1.2). It turns out, however, that in the presence of other atoms, such as e.g. 
H, O, or other C atoms, it is favourable to excite one electron from the 2s to 
the third 2p orbital, in order to form covalent bonds with the other atoms. 
The gain in energy from the covalent bond is indeed larger than the 4 eV 
invested in the electronic excitation. 

In the excited state, we therefore have four equivalent quantum- mechanical 
states, 1 2s), \2p x ), \2p y ), and \2p z ). A quantum-mechanical superposition of 
the state |2s) with n \2pj) states is called sp n hybridisation, which play an 
essential role in covalent carbon bonds. 



1.1.1 sp hybridisation 

In the sp 1 hybridisation, 3 the |2s) state mixes with one of the 2p orbitals. 
For illustration, we we have chosen the \2p x ) state. A state with equal weight 
from both original states, is obtained by the symmetric and anti-symmetric 
combinations 

\sp+) = ^= (\2s) + \2p x )) , \sp_) = -±= (|2s) - \2p x )) . 

The other states, \2p y ) and \2p z ), remain unaffected by this superposition. 
The electronic density of the hybridised orbitals has the form of a club and 



3 The superscript is often omitted, and one may alternatively use "sp hybridisation". 
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(a) 




I2s> |2p Isp + > 



I2s> -I2p x > Isp > 

(b) 




Figure 1.3: (a) Schematic view of the sp 1 hybridisation. The figure shows on the r.h.s. 
the electronic density of the \2s) and \2p x ) orbitals and on the l.h.s. that of the hybridised 
ones, (b) Acetylene molecule (H-C=C-H). The propeller- like 2p y and 2p z orbitals of the 
two C atoms strengthen the covalent o bond by forming two 7r bonds (not shown). 



6 



Introduction to Carbon Materials 




Figure 1.4: (a) Schematic view of the sp 2 hybridisation. The orbitals form angles of 
120°. (b) Benzene molecule (CeH 6 ). The 6 carbon atoms are situated at the corners of 
a hexagon and form covalent bonds with the H atoms. In addition to the 6 covalent a 
bonds between the C atoms, there are three tt bonds indicated by the doubled line, (c) 
The quantum-mechanical ground state of the benzene ring is a superposition of the two 
configurations which differ by the position of the tt bonds. The n electrons are, thus, 
delocalised over the ring, (d) Graphcne may be viewed as a tiling of benzene hexagons, 
where the H atoms are replaced by C atoms of neighbouring hexagons and where the n 
electrons are delocalised over the whole structure. 



is elongated in the +x (—x) direction for the \sp + ) (\sp-)) states [Fig. 1.3 
(a)]. This hybridisation plays a role, e.g., in the formation of the acetylene 
molecule H-C=C-H, where overlapping sp 1 orbitals of the two carbon atoms 
form a strong covalent a bond [Fig. 1.3 (b)]. The remaining unhybridised 2p 
orbitals are furthermore involved in the formation of two additional it bonds, 
which are weaker than the a bond. 



1.1.2 sp 2 hybridisation — graphitic allotopes 

In the case of a superposition of the 2s and two 2p orbitals, which we may 
choose to be the \2p x ) and the \2p y ) states, one obtains the planar sp 2 hy- 
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bridisation. The three quantum-mechanical states are given by 
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These orbitals are oriented in the xy-plane and have mutual 120° angles 
[Fig. 1.4 (a)]. The remaining unhybridised 2p z orbital is perpendicular to 
the plane. 

A prominent chemical example for such hybridisation is the benzene 
molecule the chemical structure of which has been analysed by August Kekule 
in 1865 [5]. The molecule consists of a hexagon with carbon atoms at the 
corners linked by o bonds [Fig. 1.4 (b)]. Each carbon atom has, furthermore, 
a covalent bond with one of the hydrogen atoms which stick out from the 
hexagon in a star-like manner. In addition to the 6 o bonds, the remaining 
2p z orbitals form 3 n bonds, and the resulting double bonds alternate with 
single a bonds around the hexagon. Because a double bond is stronger than 
a single o bond, one may expect that the hexagon is not perfect. A double 
bond (C=C) yields indeed a carbon-carbon distance of 0.135 nm, whereas it 
is 0.147 nm for a single a bond (C-C). However, the measured carbon-carbon 
distance in benzene is 0.142 nm for all bonds, which is roughly the average 
length of a single and a double bond. This equivalence of all bonds in ben- 
zene was explained by Linus Pauling in 1931 within a quantum-mechanical 
treatment of the benzene ring [6]. The ground state is indeed a quantum- 
mechanical superposition of the two possible configurations for the double 
bonds, as shown schematically in Fig. 1.4 (c). 

These chemical considerations indicate the way towards carbon-based 
condensed matter physics - any graphitic compound has indeed a sheet of 
graphene as its basic constituent. Such a graphene sheet may be viewed sim- 
ply as a tiling of benzene hexagons, where one has replaced the hydrogen by 
carbon atoms to form a neighbouring carbon hexagon [Fig. 1.4 (d)]. How- 
ever, graphene has remained the basic constituent of graphitic systems during 
a long time only on the theoretical level. From an experimental point of view, 
graphene is the youngest allotope and accessible to physical measurements 
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Figure 1.5: Graphitic allotopes (a) Piece of natural graphite, (b) Layered structure of 
graphite (stacking of graphene layers), (c) OD allotope: Ceo molecule, (d) ID allotope: 
single- wall carbon nanotube. (e) Optical image of a carbon nanotube. 

only since 2004. 

Historically, the longest known allotope is 3D graphite [Fig. 1.5 (a)]. 
Graphite was discovered in a mine near Borrowdale in Cumbria, England in 
the 16th century, and its use for marking and graphical purposes was almost 
immediately noticed. Indeed, the nearby farmers used graphite blocks from 
the mine for marking their sheep. Due to its softness and dark color, graphite 
was considered during a long time as some particular type of lead. The full 
name "lead pencil" still witnesses this historical error. 4 That graphite was 
formed from carbon atoms was discovered by the Swedish- German pharma- 
cist Carl Wilhelm Scheele in the middle of the 18th century. But it was the 
German chemist Abraham Gottlob Werner in 1789 who coined the material 
by its current name "graphite" , thus emphasising its main use for graphical 
purposes. 5 

Graphite may be viewed as a stacking of graphene sheets [Fig. 1.5 (b)] 
that stick together due to the van der Waals interaction, which is much 

4 The graphitic core of a pencil is still called "lead" , and the German name for "pencil" 
is "Bleistift" , "Blei" being the German name for lead. 

5 The term is derived from the Greek word jpatpeLis ("graphcin", to draw, to write). 
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weaker than the inplane covalent bonds. This physical property explains the 
graphic utility of the material: when one writes with a piece of graphite, 
i.e. when it is scratched over a sufficiently rough surface, such as a piece of 
paper, thin stacks of graphene sheets are exfoliated from bulk graphite and 
now stick to the surface. This is possible due to the above-mentioned weak 
van der Waals interaction between the graphene sheets. 

The OD graphitic allotope (fullerenes) has been discovered in 1985 by 
Robert Curl, Harold Kroto, and Richard Smalley [7]. Its most prominent 
representative is the C 60 molecule which has the form of a football and is also 
called "buckyball". It consists of a graphene sheet, where some hexagons are 
replaced by pentagons, which cause a crumbling of the sheet and the final 
formation of a graphene sphere [Fig. 1.5 (c)]. Its existance had been predicted 
before, in 1970, by the Japanese theoretician Eiji Ozawa [8]. 

Carbon nanotubes, the ID allotope, may be viewed as graphene sheets 
which are rolled up [Fig. 1.5 (d) and (e)], with a diameter of several nano- 
meters. One distinguishes single-wall from multi-wall nanotubes, according 
to the number of rolled up graphene sheets. The discovery of carbon nan- 
otubes is most often attributed to Sumio Iijima and his 1991 publication in 
Nature [9]. Recently, doubts about this attribution have been evoked because 
it seems that carbon nanotubes had a longer history in the community of ma- 
terial scientists [10]. Indeed, a publication by the Soviet scientists Radushke- 
vich and Lukyanovich in 1952 contained a transmission electron microscope 
image showing carbon nanotubes [11]. It is, however, the merit of the 1991 
Iijima paper to have attracted the interest of the condensed matter physics 
community to carbon nanotubes and to have initiated an intense research on 
this compound, also in the prospect of nanotechnological applications. 

1.1.3 sp 3 hybridisation — diamonds 

If one superposes the 2s and all three 2p orbitals, one otains the sp 3 hybridi- 
sation, which consists of four club-like orbitals that mark a tetrahedron. The 
orbitals form angles of 109.5° degrees [Fig. 1.6 (a)]. A chemical example 
for this hybridisation is methane (CH4), where the four hybridised orbitals 
are used to form covalent bonds with the Is hydrogen atoms. In condensed 
matter physics, the 2p 3 hybridisation is at the origin of the formation of di- 
amonds, when liquid carbon condenses under high pressure. The diamond 
lattice consists of two interpenetrating face-center-cubic (fee) lattices, with 
a lattice spacing of 0.357 nm, as shown in Fig. 1.6 (b). 
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Figure 1.6: (a) sp 3 hybridisation with 109.5° angle between the four orbitals. (b) Crystal 
structure of diamond (two interpenetrating fee lattices). 

Although they consist of the same atomic ingredient, namely carbon, 
the 3D graphite and diamond crystals are physically extremely different. 
Graphite, as described above, is a very soft material due to its layered struc- 
ture, whereas diamond is one of the hardest natural materials because all 
bonds are covalent a bonds. The fact that all 4 valence electrons in the outer 
atomic shell are used in the formation of the a bondes is also the reason for 
diamond being an insulator with a large band gap of 5.47 eV. In contrast 
to insulating diamond, the electrons in the weaker it bonds in graphite are 
delocalised and, thus, yield good electronic conduction properties. 

1.2 Crystal Structure of Graphene and Graphite 
1.2.1 Graphene's honeycomb lattice 

As already mentioned in the last section, the carbon atoms in graphene con- 
dense in a honeycomb lattice due to their sp 2 hybridisation. The honeycomb 
lattice is not a Bravais lattice because two neighbouring sites are not equiva- 
lent. Fig. 1.7 (a) illustrates indeed that a site on the A sublattice has nearest 
neighbours (nn) in the directions north-east, north-west, and south, whereas 
a site on the B sublattice has nns in the directions north, south-west, and 
south-east. Both A and B sublattices, however, are triangular 6 Bravais lat- 
tices, and one may view the honeycomb lattice as a triangular Bravais lattice 
with a two-atom basis (A and B). The distance between nn carbon atoms 
is 0.142 nm, which is the average of the single (C-C) and double (C=C) 

6 The triangular lattice is sometimes also called hexagonal lattice. 
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Figure 1.7: (a) Honeycomb lattice. The vectors Si, 62, and S3 connect nn carbon atoms, 
separated by a distance a = 0.142 nm. The vectors ai and a 2 are basis vectors of the tri- 
angular Bravais lattice, (b) Reciprocal lattice of the triangular lattice. Its primitive lattice 
vectors are a* and a|. The shaded region represents the first Brillouin zone (BZ), with 
its centre T and the two inequivalent corners K (black squares) and K' (white squares). 
The thick part of the border of the first BZ represents those points which are counted in 
the definition such that no points are doubly counted. The first BZ, defined in a strict 
manner, is, thus, the shaded region plus the thick part of the border. For completeness, 
we have also shown the three inequivalent cristallographic points M, M', and M" (white 
triangles). 



covalent a bonds, as in the case of benzene. 

The three vectors which connect a site on the A sublattice with a nn on 
the B sublattice are given by 

Sl = \ (y^ ex + e y) ' 82 = ^ (-V3e x + e y ^J , 6 3 = -ae y , (1.2) 
and the triangular Bravais lattice is spanned by the basis vectors 

ai = V3ae x and a 2 = ^~ {e x + V^e^j . (1.3) 

The modulus of the basis vectors yields the lattice spacing, a = v^a = 0.24 
nm, and the area of the unit cell is A uc = y/3d 2 /2 = 0.051 nm 2 . The density 
of carbon atoms is, therefore, nc = 2/A uc = 39 nm~ 2 = 3.9 x 10 15 cm -2 . 
Because there is one 7r electron per carbon atom that is not involved in a 
covalent o bond, there are as many valence electrons than carbon atoms, and 
their density is, thus, n n = n c = 3.9 x 10 15 cm~ 2 . As is discussed in the 
following chapter, this density is not equal to the carrier density in graphene, 
which one measures in electrical transport measurements. 
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The reciprocal lattice, which is defined with respect to the triangular 
Bravais lattice, is depicted in Fig. 1.7 (b). It spanned by the vectors 

2tt / e y \ , * 47r 

a, = —=- e x — and a 9 = — e v . 1.4 

Physically, all sites of the reciprocal lattice represent equivalent wave vectors. 
Any wave - be it a vibrational lattice excitation or a quantum-mechanical 
electronic wave packet - propagating on the lattice with a wave vector differ- 
ing by a reciprocal lattice vector has indeed the same phase up to a multiple 
of 2-7T, due to the relation 

• si} = 2n6ij (1.5) 

(for i,j = 1,2) between direct and reciprocal lattice vectors. The first Bril- 
louin zone [BZ, shaded region and thick part of the border of the hexagon in 
Fig. 1.7 (b)] represents a set of inequivalent points in the reciprocal space, 
i.e. of points which may not be connected to one another by a reciprocal 
lattice vector, or else of physically distinguishable lattice excitations. The 
long wavelength excitations are situated in the vicinity of the T point, in 
centre of the first BZ. Furthermore, one distinguishes the six corners of the 
first BZ, which consist of the inequivalent points K and K' represented by 
the vectors 

±K = ±^=~e x . (1.6) 

The four remaining corners [shown in gray in Fig. 1.7 (b)] may indeed be con- 
nected to one of these points via a translation by a reciprocal lattice vector. 
These cristallographic points play an essential role in the electronic proper- 
ties of graphene because their low-energy excitations are centered around the 
two points K and K', as is discussed in detail in the following chapter. We 
emphasise, because of some confusion in the litterature on this point, that 
the inequivalence of the two BZ corners, K and K', has nothing to do with 
the presence of two sublattices, A and B, in the honeycomb lattice. The 
form of the BZ is an intrinsic property of the Bravais lattice, independent 
of the possible presence of more than one atom in the unit cell. For com- 
pleteness, we have also shown, in Fig. 1.7 (b), the three crystallographically 
inequivalent M points in the middle of the BZ edges. 
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Figure 1.8: Two possibilities for stacking two graphene layers. The symbols correspond 
to the following lattice sites: black circles are A sites in the lower layer, black triangles 
B sites in the lower layer, white circles A' sites in the upper layer, and white triangles B' 
sites in the upper layer, (a) The upper layer is translated by Si with respect to the lower 
one; the A' sites are on top of the B sites, (b) The upper layer is translated by —Si with 
respect to the lower one; the B' sites are on top of the A sites. 

1.2.2 Graphene stacking — the different forms of graphite 

Graphite consists of stacked graphene layers. One distinguishes crystal (or- 
dered) graphite, with two different basic stacking orders, from turbostratic 
graphite with a certain amount of disorder in the stacking. 

To illustrate the ordering of graphene layers in crystal graphite, we first 
consider only two graphene layers (bilayer graphene). The distance between 
layers is roughly d = 2.4a = 0.34 nm, and the stacking is such that there are 
atoms in the upper layer placed at the hexagon centres of the lower layer. 
The layers are, thus, translated with respect to each other, and one may 
distinguish two different patterns, as shown in Fig. 1.8. The displacement 
is given by either Si or —Si, where one may choose any of the nn vectors 
(1.2) with % — 1, 2, or 3. For the choice Si, one obtains a configuration where 
the A' atoms of the upper layer are on top of the B atoms of the lower one 
[Fig. 1.8 (a)], and for a translation of — Si, the B' atoms in the upper layer 
are on top of A atoms in the lower one [Fig. 1.8 (b)]. Notice that the two 
configurations, both of which contain now four atoms (A, B, A', and B') per 
unit cell, are equivalent if one has a reflexion symmetry in the perpendicular 
axis, z — > —z. 

Although physically not relevant in the bilayer case, this difference in the 
stackingturns out to be important when considering crystal graphite with an 
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infinity of stacked layers. Let us consider the situation where the second layer 
is translated with respect to the first by <5j. The third layer may be chosen 
to be translated with respect to the second either by — <5j, in which case its 
sites coincide with those of the first one, or again by Si, where the atomic 
configuration of the first layer is not recovered. In the first case, one therefore 
obtains an ABA stacking, whereas the stacking is ABC in the second case. 
In general, one distingushes two types of ordering: 

1. All layers are translated with respect to their lower neighbour by Si. 
One obtains a rhombohedral stacking, which is also called ABC stack- 
ing because one needs three layers to recover the atomic configuration 
of the original layer. There are, thus, 6 atoms per unit cell which has 
an extension of 3d in the z-direction (/3-graphite) . 

2. If the sign of the translation alternates, i.e. Si, — S iy S iy — 6i, when 
stacking the graphene layers, one obtains a hexagonal (or else AB or 
Bernal) stacking. Here, one has 4 atoms per unit cell which has a height 
of 2d in the z-direction (a-graphite). 

In principle, it is possible to have some randomness in the stacking, i.e. 
ABC parts may randomly substitute AB parts when considering a-graphite. 
However, a crystalline AB stacking occurs most often in nature, with about 
30% of ABC-stacked graphite. 

In the case of turbostratic graphite, one may distinguish translational dis- 
order from rotational disorder in the stacking. Generally, the graphene layers 
are much less bound together in turbostratic than in crystalline graphite, such 
that turbostratic graphite is a better lubrificant. 

As an example of rotational disorder, we consider two graphene layers 
which are rotated by an angle 

ai ■ a' x , 
l d i|l d il 

with respect to each other, where a.[ denotes the lattice vector in the upper 
graphene layer corresponding to ai in the lower one. If the angle fulfils certain 
comensurability conditions, one obtains a so-called Moire pattern, with a 
larger unit cell as depictecd in Fig. 1.9. The Moire pattern reproduces a 
(larger) honeycomb lattice. 
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Figure 1.9: Moire pattern obtained by stacking two honeycomb lattices (gray) with a 
relative (chiral) angle 0. One obtains a hexagonal superstructure indicated by the black 
hexagons. 



16 



Introduction to Carbon Materials 



Figure 1.10: (a) AFM image of a grapheme flake on a SiC>2 substrate, (b) Transmission- 
electron-microscopic image of free-hanging graphene. Both pictures have been taken by 
the Manchester group. 

1.3 Fabrication of Graphene 

We briefly review the two techniques used in fabricating graphene, mechan- 
ical exfoliation of graphene from bulk graphite and graphitisation of epitax- 
ially grown SiC crystals. The resulting graphene probes have similar, but 
not always equal, physical properties. It is therefore important to describe 
both systems separately and to clearly indicate on what type of graphene, 
exfoliated or epitaxial, a particular measurement is performed. A systematic 
comparison of both graphene types is still an open research field. 

1.3.1 Exfoliated graphene 

The mechanical exfoliation technique, elaborated mainly by the Manchester 
group, consistsy of peeling a layered material [13]. In the case of graphite, it 
uses its primary graphical capacity, which we have already alluded to above: 
if one scratches a piece of graphite on a substrate, thin graphite stacks are 
exfoliated from the bulk and left behind on the substrate. Most of these thin 
stacks comprise several (tens or hundreds) of graphene sheets, but a few of 
them only consist of a single graphene layer. The fabrication of graphene is, 
thus, extremely simple, and we produce single layers of graphene whenever 
we write with a pencil of sufficiently high graphitic quality. 7 

In practice, one does not use a pencil to fabricate graphene layers, but one 
prepares very thin samples by peeling a small graphite crystallite in a piece 




7 Not all pencils are of the required quality. The graphite lead of most pencils contains 
normally a certain amount of clay impurities. 
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of folded scotchtape. Each time one peels away the tape, the graphite flake 
is cleaved into two parts and thus becomes thinner. After several cycles, 
the scotchtape with the graphene sheets stuck to it is glued to the SiC>2 
substrate, prepared by a mix of hydrochloric acid and hydrogen peroxide 
to accept better the graphene sheets from the scothtape. When the tape is 
carefully peeled away, the graphene sheets remain glued to the substrate. 

The main problem consists of the identification of the few single graphene 
layers, which are randomly distributed on the substrate. A definitve identi- 
fication may be achieved by atomic- force microscopy (AFM), the scanning 
capacity of which is unfortunately limited to a very small surface during a 
reasonable time. One therefore needs to have a hint of where to search for 
mono-layer graphene. This hint comes from a first (optical) glance at the 
substrate. The 300 nm thick SiC>2 substrate, which was originally used by 
the Manchester group and which is now most common, turns out to yield an 
optimal contrast such that one may, by optical meanse, identify mono-layer 
graphene sheets with a high probability. AFM imaging is then used to con- 
firm this first glance. An example of such AFM image is shown in Fig. 1.10 
(a). Notice that it has been shown that one may also depose an exfoliated 
graphene sheet on a scaffold such that it is free-hanging over a rather large 
surface of several fim 2 [Fig. 1.10 (b)]. 

The exfoliation technique is not limited to graphite, but may also be 
used in the fabrication of ultra-thin samples from other layered crystals. 
The Manchester group has indeed shown that one may, by this technique, 
fabricate e.g. single layer BN, NbSe2, Bi 2 Sr 2 CaCu20 x , and M0S2 crystallites 
of a typical size of several /im 2 [13]. The exfoliation technique is, therefore, 
extremely promising in the study of truly 2D crystals. 

One may, furthermore, control the carrier density in the metallic 2D crys- 
tals, such as graphene, by the electric field effect; the 300 nm thick insulating 
Si02 layer is indeed on top of a positively doped metallic Si substrate, which 
serves as a backgate. The combined system graphene-Si02-backgate may, 
thus, be viewed as a capacitor (see Fig. 1.11) the capacity of which is 



C — — 



Q e eA 



(1.7) 



Vg d 



where Q = en 2 nA is the capacitor charge, in terms of the total surface A, 
V g is the gate voltage, d = 300 nm is the thickness of the Si02 layer with 
the dielectric constant e = 3.7. The field-effect induced 2D carrier density is 
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Figure 1.11: Schematic view of graphene on a SiC-2 substrate with a doped Si (metallic) 
backgate. The system graphene-SiCVbackgate may be viewed as a capacitor the charge 
density of which is controled by a gate voltage V g . 

thus given by 

n 2D = ctV Q with a = ^ - 7.2 x 10 10 ^— . (1.8) 

ed V 

The gate voltage may vary roughly between —100 and 100 V, such that one 
may induce maximal carrier densities on the order of 10 12 cm~ 2 , on top of 
the intrinsic carrier density which turns out to be zero in graphene, as will be 
discussed in the next chapter. At gate voltages above ±100 V, the capacitor 
breaks down (electrical breakdown). 

1.3.2 Epitaxial graphene 

An alternative method to fabricate graphene has been developed by the At- 
lanta group, led by Walt de Heer and Claire Berger [14]. It consists of expos- 
ing an epitaxially grown hexagonal (4H or 6H-) SiC crystal 8 to temperatures 
of about 1300° C in order to evaporate the less tightly bound Si atoms from 
the surface. The remaining carbon atoms on the surface form a graphitic 
layer (graphitsation). The physical properties of this graphitic layer depend 
on the chosen SiC surface. In the case of the Si-terminated (0001) surface, 
the graphitisation process is slow, and one may thus control the number of 
formed graphene layers (usually one or two). The resulting electron mobility, 

8 SiC exists also in several other crystal structures, such as the zinc blende structure 
(3C-SiC), i.e. a diamond structure with different atom types on the two distinct fee 
sublattices. 
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(a) (b) 




Figure 1.12: Epitaxial Graphene. (a) Schematic view on epitaxial graphene. On top 
of the SiC substrate, a graphitic layer. It consists of several one-atom thick caron layers. 
The first one is a buffer layer, which is tightly bound to the substrate, at a distance of 0.2 
(0.17) nm for a Si- (C-)terminated surface. The graphene layers are formed on top of this 
buffer layer at a distance of 0.38 (0.39) nm, and they are equally spaced by 0.39 nm. (b) 
AFM image of epitaxial graphene on C-terminated SiC substrate. The steps those of the 
SiC substrate. The 5-10 graphene layers lie on the substrate similar to a carpet which has 
folds visible as white lines on the image. 



however, turns out to be rather low, such that the Si-terminated surface is 
less chosen for the fabrication of samples used in transport measurements. 
For a C-terminated (0001) surface, the graphitisation process is very fast, 
and a large number of graphene layers are formed (up to 100). In contrast 
to epitaxial graphene on the Si-terminated surface, the electron mobility is, 
here, rather high. 

In contrast to exfoliated graphene, the SiC substrate must be consid- 
ered as an integral part of the whole system of epitaxial graphene. It is 
indeed the mother compound, and the first graphitic layer formed during the 
graphitisation process is tightly bound to the SiC substrate (Fig. 1.12). The 
distance between this layer and the substrate has been estimated numerically 
to be 0.20 nm for a Si-terminated surface and 0.17 nm for a C-terminated 
surface [15]. These distances are much smaller than the distance between 
graphene sheets in crystalline graphite (2.4a = 0.34 nm). 0.38 (0.39) nm 
above this buffer layer for Si- (C-)terminated SiC, several graphene layers 
are formed, which are separated by a distance of 0.39 nm. These character- 
istic distances are roughly 20% larger than the layer distances in crystalline 
graphite, and one may therefore expect them to be less tightly bound. The 
above-mentioned numerical values for the distances have been confirmed by 
X-ray measurements [15]. 
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Furthermore, X-ray diffraction measurements have revealed a certain 
amount of rotational disorder in the stacking of the graphene layers [16]. 
This, together with the rather large spacing between the graphene sheets on 
top of the buffer layer, corroborates the view that the graphitic layer on the 
SiC substrate consists indeed of almost independent graphene layers rather 
than of a thin graphite flake. However, the charge is not homogeneously 
distributed between the different graphene layers - it turns out that the 
graphene layer, 9 which is closest to the substrate, is electron- doped due to a 
charge transfer from the bulk SiC, whereas the subsequent layers are hardly 
charged. As will be discussed in the following chapters, this inhomogeneous 
charge distribution in epitaxial graphene may be at the origin of different 
results in transport measurements when compared with exfoliated graphene 
and theoretical predictions. 



9 We refer to the layers on top of the buffer layer when using the term "graphene" . The 
buffer layer may not be counted due to its tight bonding to the SiC substrate. 



Chapter 2 



Electronic Band Structure of 
Graphene 



As we have discussed in the introduction, three electrons per carbon atom in 
graphene are involved in the formation of strong covalent a bonds, and one 
electron per atom yields the ir bonds. The ir electrons happen to be those re- 
sponsible for the electronic properties at low energies, whereas the a electrons 
form energy bands far away from the Fermi energy. This chapter is, thus, 
devoted to a discussion of the energy bands of it electrons within the tight- 
binding approximation, which was originally calculated for the honeycomb 
lattice by P. R. Wallace in 1947 [17]. We do not consider the a electrons, 
here, and refer the interested reader to the book by Saito, Dresselhaus, and 
Dresselhaus for a tight-binding calculation of the energy bands formed by 
the a electrons [12], which are far away from the Fermi level. 

The chapter consists of two sections. The first one is devoted to the cal- 
culation of the n energy bands in graphene, where one is confronted with 
the problem of two atoms per unit cell. After a brief discussion of Bloch's 
theorem, and a formal solution of the tight-binding model, we calculate the 
energy dispersion of n electrons in graphene, taking into account nearest- 
neighbour (nn) and next-nearest-neighbour (nnn) hopping and nn overlap 
corrections. The second section consists of the continuum limit, which de- 
scribes the low-energy properties of electrons in graphene. 
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2.1 Tight-Binding Model for Electrons on the 
Honeycomb Lattice 

The general idea of the tight-binding model is to write down a trial wave- 
function constructed from the orbital wavef unctions, <p^(r — R 3 -), of the 
atoms forming a particular lattice described by the (Bravais) lattice vectors 
Rj = mja.i+rijSi2, where rrij and rij are integers. 1 In addition, the trial wave- 
function must reflect the symmetry of the underlying lattice, i.e. it must be 
invariant under a translation by any arbitrary lattice vector Rj. We con- 
sider, for simplicity in a first step, the case of a Bravais lattice with one atom 
per unit cell and one electron per atom. The Hamiltonian for an arbitrary 
electron, labelled by the integer /, is given by 

Ifi—^Aj + fVfo-R,-), (2.1) 

i=i 

where A; = Vf is the 2D Laplacian operator, in terms of the 2D gradient 
V; = d/dxi + d/dyi with respect to the electron's position r; = (xi,yi), and 
m is the electron mass. Each ion on site Rj yields an electrostatic potential 
felt by the electron, and its overall potential energy J2j V( r i ~ Rj)> where N 
is the number of lattice sites, is, therefore, a periodic function with respect 
to an arbitrary translation by a lattice vector Rj in the thermodynamic limit 
N — > oo. The total Hamiltonian is the sum over all electrons, 

N 

H = Y,H h (2.2) 
i 

if we suppose one electron per lattice site, as mentioned above. 

The tight-binding approach is based on the assumption that the electron I 
is originally bound to a particular ion at the lattice site R«, i.e. it is described 
to great accuracy by a bound state of the (atomic) Hamiltonian 

whereas the contributions to the potential energy AV = J2f^i ^( r z — Rj) from 
the other ions at the sites Rj, j ^ I, may be treated perturbatively. The 

1 Because we are interested in 2D lattices, we limit the discussion to two dimensions, 
the generalisation to arbitrary dimensions being straight-forward. 
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bound state of Hf is described by the above-mentioned atomic wavefunction 
0( a )(r-Rj). 

2.1.1 Bloch's theorem 

Another ingredient, apart from the atomic wavefunction, of the trial wave- 
function is a symmetry consideration - the trial wavefunction must respect 
the discrete translation symmetry of the lattice. This is the essence of Bloch's 
theorem. In quantum mechanics, a translation by a lattice vector Rj may be 
described by the operator 

T Ri = e*P' R % (2.3) 

in terms of the momentum operator p, which needs to be appropriately 
defined for the crystal and which we may call quasi-momentum operator, 
as will become explicit below. The symmetry operator, because it describes 
a symmetry operation under which the physical problem is left invariant, 
commutes with the full Hamiltonian (2.2), [7^.,H] = 0. The eigenstates of 
H are, therefore, necessarily also eigenstates of Tr^, for any lattice vector Rj, 
and the momentum p, which is the eigenvalue of the momentum operator 
p, is a good quantum number. Because of the relation (1.5) between the 
basis vectors of the direct and the reciprocal lattices, this momentum is 
only defined modulo a reciprocal lattice vector Gj = m*a* + n*^, where 
m* and n* are arbitrary integers. Indeed, if we had chosen the momentum 
operator p' = p + HGj instead of p in the definition (2.3) of the discrete 
translation operator, we would have simply multiplied it with a factor of 
exp(iGj ■ Rj) = exp(i27m) = 1 because of the integer value n = m*mi + n*rii. 
We, thus, need to identify, as pointed out in the last chapter, identify all 
momenta which differ by a reciprocal lattice vector, and it is more convenient 
to speak of a quasi-momentum p = hk, which is restricted to the first BZ. 

The trial wavefunction, constructed from the atomic orbital wavefunc- 
tions (a) (r -Rj), 

^k(r) = ^e lk - R ^W(r-R,) (2.4) 

fulfils the above-mentioned requirements, i.e. it is an eigenstate of the trans- 
lation operator (2. 3). 2 That the wavefunction (2.4) is indeed an eigenstate 

2 Thc sum X)r i s a short notation for X)m n an< ^ runs over au lattice vectors 
Rj = m^-ai + rij&2, in the thermodynamic limit of an infinite lattice. 
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of 7^ may be seen from 



= ^ k (r + R,) 
= ^V k ' R ^ (a) [r 




Ri)] 




00 



a 





^k(r), 



where the first line reflects the translation by the lattice vector in the 
argument of the wavef unction, and we have resummed the lattice vectors in 
the last line with the redefinition R m = R^ — Rj. 

2.1.2 Lattice with several atoms per unit cell 

If we have several atoms per unit cell, such as in the case of the honeycomb 
lattice, the reasoning described above must be modified. Notice first that a 
translation by any vector Sj that relates a site on one sublattice to that on a 
second sublattice is not a symmetry operation, i.e. [%.,H] ^ 0, if we define 
T$. = exp(ip • Sj/h) in the same manner as the translation operator (2.3). 
One must, therefore, treat the different sublattices apart. 

In the case of two atoms per unit cell, we may write down the trial 
wavefunction as 



where Ok and 6k are complex functions of the quasi-momentum k. Both 
ijj^ (r) and ip^ (r) are Bloch functions with 



where j = A/B labels the atoms on the two sublattices A and B, and Sj 
is the vector which connects the sites of the underlying Bravais lattice with 
the site of the j atom within the unit cell. Typically one chooses the sites 
of one of the sublattices, e.g. the A sublattice, to coincide with the sites of 
the Bravais lattice. Notice furthermore that there is some arbitrariness in 
the choice of the phase in Eq. (2.6) - instead of choosing exp(ik • R z ), one 
may also have chosen exp[ik ■ (R; — Sj), as for the arguments of the atomic 
wavef unctions. The choice, however, does not affect the physical properties 



^k(r) = ak^k ( r ) + Mi ( r ) 



(2.5) 




(2.6) 
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of the system because it simply leads to a redefinition of the weights a k and 
6k which aquire a different relative phase [18]. 

With the help of these wavefunctions, we may now search the solutions 
of the Schrodinger equation 

Hip* = ekVv 

Here, we have chosen an arbitrary representation, which is not necessarily 
that in real space. 3 Multiplication of the Schrodinger equation by ip^ from 
the left yields the equation ip^Hip^ = e^i/j^k, which may be rewritten in 
matrix form with the help of Eqs. (2.5) and (2.6) 

(albi)n k ^^e k (albi)S k ^y (2.7) 

Here, the Hamiltonian matrix is defined as 

_ / ti*>HiP 4{ A »H4,<f> \ _ , 

and the overlap matrix 

accounts for the non-orthogonality of the trial wavefunctions. The eigen- 
values ek of the Schrodinger equation are the energy dispersions or energy 
bands, and they may be obtained from the secular equation 

det [H k - e£<S k ] = 0, (2.10) 

which needs to be satisfied for a non-zero solution of the wavefunctions, i.e. 
for a k 7^ and 6 k ^ 0. The label A denotes the energy bands, and it is clear 
that there are as many energy bands as solutions of the secular equation 
(2.10), i.e. two bands for the case of two atoms per unit cell. Notice that the 
generalisation to n atoms per unit cell is straight-forwards - in this case, the 
wavefunction (2.5) needs to be written as 



k ' 



3 The wavefunction Vk(r) is, thus, the real space representation of the Hilbcrt vector 



26 



Electronic Band Structure of Graphene 



where the superscript j denotes the different atoms per unit cell. The secular 
equation (2.10) remains valid, in terms of the Hermitian n x n matrices 

?% = l£>Hl4p and SS = l£>lg\ (2.11) 

and it is now an equation of degree n. This means that there are n energy 
bands, i.e. as many energy bands as atoms per unit cell. 

In a first step, one often neglects the overlap corrections, i.e. one assumes 
a quasi-orthogonality of the wavef unctions, S£ = NSy. It turns out, however, 
to keep track of these overlap corrections in the case of graphene. As is 
discussed in the following section, they yield a contribution which is on the 
same order of magnitude as the nnn hopping corrections. 

Formal solution 

Before turning to the specific case of graphene and its energy bands, we solve 
formally the secular equation for an arbitrary lattice with several atoms per 
unit cell. The Hamiltonian matrix (2.11) may be written, with the help of 
Eq. (2.6), as 

n v = ^ e ik - (R '- Rm) J d 2 r 0W*(r + ^ - R k )H^\r + 5 j - 
= N e ' k ' R ' / d2r W *( r ) \H a + AV \ O °(r + <^ - Rj) 

= n^4 + 4) 

where Sij = Sj — Si, 

4 = ^Te ikR < [ rfW«'(r + 6 t - R*)0 ( % + 8 j - R m ) = ^ (2.13) 
R; J 

and we have defined the hopping matrix 

4 = e lk Rl [ d 2 r 0«*(r + S t - R k )AV^\r + 5 j - R m ) . (2.14) 
Rj J 

The last line in Eq. (2.12) has been obtained from the fact that the atomic 
wavefunctions 0^(r) are eigenstates of the atomic Hamiltonian H a with the 



Rm) 



(2.12) 
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atomic energy for an orbital of type i. This atomic energy plays the role 
of an onsite energy. The secular equation now reads 

det [*£"-(<£- e<*>)] =0. (2.15) 

Notice that, if the the atoms on the different sublattices are all of the same 
electronic configuration, one has = eo for all i, and one may omit this 
onsite energy, which yields only a constant physically irrelevant shift of the 
energy bands. 



2.1.3 Solution for graphene with nearest-neighbour and 
next-nearest-neighour hopping 

After these formal considerations, we now study the particular case of the 
tight-binding model on the honeycomb lattice, which yields, to great accu- 
racy, the n energy bands of graphene. Because all atomic orbitals are p z 
orbitals of carbon atoms, we may omit the onsite energy eo, as discussed 
in the last paragraph. We choose the Bravais lattice vectors to be those of 
the A sublattice, i.e. 8a = 0, and the equivalent site on the B sublattice is 
obtained by the displacement 6b = Sab — <^3 (see Fig. 2.1). The hopping 
amplitude between nn is given by the expression 

t = [ d 2 r(f) A *(r)AV(j) B (r + d 3 ), (2.16) 
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and we also take into account nnn hopping which connects nn sites on the 
same sublattice 

t nnn = J d 2 r0 A *(r)AV> A (r + ai ) = J d 2 r B *(r)AV» + &1 ). (2.17) 

Notice that one may have chosen any other vector Sj or a 2 , respectively, in 
the calculation of the hopping amplitudes. Because of the normalisation of 
the atomic wavef unctions, we have J d 2 rcj)^* (r)<f>^ (r) = 1, and we consider 
furthermore the overlap correction between orbitals on nn sites, 

s = J d 2 r<f> A *{r)<f) B {r + 5 3 ). (2.18) 

We neglect overlap corrections between all other orbitals which are not nn, 
as well as hopping amplitudes for larger distances than nnn. 

If we now consider an arbitrary site A on the A sublattice (Fig. 2.1), we 
may see that the off-diagonal terms of the hopping matrix (2.14) consist of 
three terms corresponding to the nn B 1: B 2 , and B 3 , all of which have the 
same hopping amplitude t. However, only the site B 3 is described by the 
same lattice vector (shifted by S3) as the site A and thus yields a zero phase 
to the hopping matrix. The sites B\ and B 2 correspond to lattice vectors 
shifted by ai and a 3 = a 2 — ai, respectively. Therefore, they contribute 
a phase factor exp(ik • a x ) and exp(?k ■ a 3 ), respectively. The off-diagonal 
elements of the hopping matrix may therefore be written as 4 

as well as those of the overlap matrix 

( s k A = s k B = 1; due to the above-mentioned normalisation of the atomic 
wavefunctions), where we have defined the sum of the nn phase factors 

7k = 1 +e ik ' ai +e ik - as . (2.19) 



4 Thc hopping matrix element t£ B corresponds to a hopping from the B to the A 
sublattice. 
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The nnn hopping amplitudes yield the diagonal elements of the hopping 
matrix, 

t£ A = tg B = t nnn (e ik ai + e- ik - ai + e ik - aa + e" ik a2 + e lk ' a3 + e" ik ' as ) 



— 2t nnn ^ ] cos(k • aj) — t r 



7k 



3). 



and one obtains, thus, the secular equation 



det 



*k - e k 



(t - se k ) 7k 
with the two solutions (A = ±) 



(t - se k )7 k 
£ k _ e k 



eu = 



t&* + At| 7k | 



(2.20) 



(2.21) 



1 + As|7 k | 

This expression may be expanded under the reasonable assumptions s < 1 
and t nnn <^ t, which we further justify at the end of the paragraph, 



e k 



4 A + At| 7k | - st| 7k | 2 = CJ7k| 2 + At| 7k | 



= 2^ nn ^cos(k-ai) + At 



i 



3 + 2 ^ cos(k ■ 



where we have defined the effective nnn hopping amplitude 

t' = t —it 



(2.22) 



(2.23) 



and we have omitted the unimportant constant — 3t nnn in the last equation. 
One, therefore, notices that the overlap corrections simply yield a renor- 
malisation of the nnn hopping amplitudes. The hopping amplitudes may be 
determined by fitting the energy dispersion (2.22) obtained within the tight- 
binding approximation to those calculated numerically in more sophisticated 
band-structure calculations. These yield a value of t ~ — 3 eV for the nn 
hopping amplitude and t' nnn ~ O.lt, which justifies the above-mentioned ex- 
pansion for t' nnn /t ^ 1. Notice that this fitting procedure does not allow for 
a distinction between the "true" nnn hopping amplitude t nnn and the contri- 
bution from the overlap correction —st. We, therefore, omit this distinction 
in the following discussion and omit the prime at the effective nnn hopping 
amplitude, but one should keep in mind that it is an effective parameter with 
a contribution from nn overlap corrections. 
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(b) 



K 



Figure 2.2: Energy dispersion obtained within the tight-binding approximation, for 
tnnn/t = 0.1. One distinguishes the valence (tt) band from the conduction (jr*) band. The 
Fermi level is situated at the points where the w band touches the tt* band, (a) Energy 
dispersion as a function of the wave- vector components k x and k y . (b) Cut throught the 
energy dispersion along characteristic lines (connecting the points K — » Y — ► M — * K. 
The energy is measured in units of t and the wave vectors in units of 1/a. 



Energy dispersion of tt electrons in graphene 

The energy dispersion (2.22) is plotted in Fig. 2.2 for t nnn /t = 0.1. It 
consists of two bands, labeled by the index A = ±, each of which contains 
the same number of states. Because each carbon atom contributes one tt 
electron and each electron may occupy either a spin-up or a spin-down state, 
the lower band with A = — (the tt or valence band) is completely filled and 
that with A = + (the tt* or conduction band) completely empty. The Fermi 
level is, therefore, situated at the points where the tt band touches the tt* 
band. Notice that, if t nnn = 0, the energy dispersion (2.22) is electron-hole 
symmetric, i.e. e£ = — e^ A . This means that nnn hopping and nn overlap 
corrections break the electron-hole symmetry. The points, where the tt band 
touches the tt* band, are called Dirac points, for reasons that are explained 
in the following chapter. They situated at the points \sP where the energy 
dispersion (2.22) is zero, 




0. 



(2.24) 
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Eq. (2.24) is satisfied when 7 k o = 0, i.e. when 
Re7 k c = 1 + cos(k D • a 2 ) + cos(k I) • a 3 ) 



= 1 + cos 



+ cos 



(2.25) 



= 



and, equally, 



Ini7 k D 



sin 



(*? + V3k°) 



+ sin 



0. (2.26) 



The last equation may be satisfied by the choice k® = 0, and Eq. (2.25), 
thus, when 



l+2cos|^lV 



k 



D 



± 



47T 



3^0 



Comparison with Eq. (1.6) shows that there are, thus, two 5 inequivalent 
Dirac points D and D', which are situated at the points K and K', respec- 
tively, 

k D = ±K = ±^^e x . (2.27) 
3y3a 

Although situated at the same position in the first BZ, it is useful to make a 
clear conceptual distinction between the Dirac points, which are defined as 
the points where the two bands 7r and it* touch each other, and the purely 
crystallographic points K and K' ', which are defined as the corners of the first 
BZ. There are, indeed, situations where the Dirac points move away from the 
points K and K', e.g. when the nn hopping amplitudes are no longer the 
same in the directions 6i, 82, and £3 [19]. In the following chapters, we will, 
however, consider the natural situation for graphene where the Dirac points 
are situated at the BZ corners and use the notation K and K' for the Dirac 
points. 

Notice that because of the symmetry e_ k = e k , which is a consequence of 
time-reversal symmetry, Dirac points occur necessarily in pairs - if k D is a 



5 We remind the reader that there are only two inequivalent points, and not six as 
Fig. 2.2 (b) might suggest. As pointed out in Sec. 1.2.1, there are pairs of three points 
that may be connected to one another by a reciprocal lattice vector and that are, thus, 
crystallographically equivalent. 
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solution of e k = 0, so is — k . In graphene, there is one pair of Dirac points, 
and the zero-energy states are, therefore, doubly degenerate. One speaks 
of a twofold valley degeneracy, which survives when we consider low-energy 
electronic excitations that are restricted to the vicinity of the Dirac points, 
as is discussed in section 2.2. 



Effective tight-binding Hamiltonian 

Before considering the low-energy excitations and the continuum limit, it is 
useful to define an effective tight-binding Hamiltonian, 



This Hamiltonian effectively omits the problem of non-orthogonality of the 
wavefunctions by a simple renormalisation of the nnn hopping amplitude, as 
alluded to above. The eigenstates of the effective Hamiltonian (2.28) are the 
spinors 



the components of which are the probability amplitudes of the Bloch wave- 
function (2.5) on the two different sublattices A and B. They may be deter- 
mined by considering the eigenvalue equation H)s.(t nnn = O)^ = At^kl^k, 
which does not take into account the nnn hopping correction. Indeed, these 
eigenstates are also those of the Hamiltonian with t nnn ^ because the nnn 
term is proportional to the one-matrix 1. The solution of the eigenvalue 
equation (2.29) yields 




(2.28) 



Here, 1 represents the 2x2 one-matrix 





(2.29) 
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and, thus, the eigenstates 6 




( 



1 



(2.30) 



where we have defined the angle 



</?k = arctan 



Im7 k 
Re7 k 



(2.31) 



As one may have expected, the spinor represents an equal probability to 
find an electron in the state on the A as on the B sublattice because both 
sublattices are built from carbon atoms with the same onsite energy e^. 7 



In order to describe the low-energy excitations, i.e. electronic excitations the 
characteristic energy of which is much smaller than the band width ~ \t\, 
one may concentrate on excitations at the Fermi level. This amounts to 
restricting the excitations to quantum states in the vicinity of the Dirac 
points, and one may expand the energy dispersion around ±K. The wave 

6 The eigenstates are defined up to a global (but k-dependent) phase, and one may also 
choose 



which is also found in the litterature. 

7 This is not the case for other lattices such as 2D boron nitride (BN), which also forms 
a hexagonal lattice. However, in the case of BN, one sublattice consists of boron atoms 
with an atomic (onsite) energy e A and the other one of nitrogen atoms with an onsite 
energy e B ^ e A . The difference in the onsite energy /i = e A — e B may be accounted for in 
the effective Hamiltonian (2.28) if one adds a term 



2.2 Continuum Limit 





This term opens a gap in the energy dispersion at the points K and K', 




and it is energetically favourable to fill preferentially the sublattice with lower onsite 
energy. 
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vector is, thus, decomposed as k = ±K + q, where |q| <C |K| ~ 1/a. The 
small parameter, which governs the expansion of the energy dispersion, is, 
therefore, |q|a <C 1. 

It is evident from the form of the energy dispersion (2.22) and the effective 
Hamiltonian that the basic entity to be expanded is the sum of the phase 
factors 7 k . We need to distinguish the sum at the K point from that at the 
K' point, 



it 



7k=±K+q 



1 _|_ e ±*K-a 2e iq-a 2 _|_ e ±iK-a 3e iq-a 3 



1 + e 

+e Ti2vr/3 
±(0) 



1 + iq- a 2 - -(q- a 2 ) 



1 -Hq • a 3 - -(q • a 3 ) 



7 



+ 7q ±(1) +7q ±(2) 



By definition of the Dirac points and their position at the BZ corners K and 

±r 

|q|a. 



K\ we have 7q ( ^ ' ) = 7±k = 0. We limit the expansion to second order in 



First order in |q|a 

The first order term is given by 



~±(i) - r 

7q - 1 



, } L (& + VSqy^/* + (-q x + V3q y )e^ 3 \ 
T ^(q x ±iq y ), (2.32) 



which is obtained with the help of sin(±27r/3) = ±\/3/2 and cos(±27r/3) = 
— 1/2. This yields the effective low-energy Hamiltonian 

Hf* = thv F (q x o* + Zqy<T v ), (2.33) 

where we have defined the Fermi velocity 8 

3ta 3\t\a 

v ^-2H=it (2 - 34) 



8 The minus sign in the definition is added to render the Fermi velocity positive because 
the hopping parameter t ~ — 3eV happens to be negative, as mentioned in the last section. 
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and the Pauli matrices 



( 



1 

1 



) 



and 



( 



-i 

1 



) 



Furthermore, we have introduced the valley isospin £ = ±, where £ = + 
denotes the K point at +K and £ = — the K' point at — K. The low-energy 
Hamiltonian (2.33) does not take into account nnn hopping corrections, which 
are proportional to |7k| 2 and, thus, occur only in the second-order expansion 
of the energy dispersion [at order (|q|a) 2 ]. The energy dispersion (2.22), 
therefore, reads 



independent of the the valley isospin £. We have already alluded to this 
twofold valley degeneracy in Sec. 2.1.3, in the framework of the discussion 
of the zero-energy states at the BZ corners. The twofold valley degeneracy, 
thus, survives when considering the low-energy excitations in the vicinity of 
the Dirac points. From Eq. (2.35) it is apparent that the continuum limit 
|q|a <C 1 coincides with the limit |e| <C \t\, as described above, because 
|e q | = 3£a|q|/2 < \t\. 

It is convenient to invert the spinor components at the K' point (for 



q,£=± - 



(2.35) 




i.e. to invert the role of the two sublattices. In this case, the effective low- 
energy Hamiltonian may be represented as 



Hf* = £hv F (q x cr x + q y a y ) = kv F r z ® q • a, (2.36) 



where we have introduced the four-spinor representation 



\ <- ) 



in the last line via the 4x4 matrices 
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and <r = (<j x ,a y ). In this four-spinor representation, the first two com- 
ponents represent the lattice components at the K point and the last two 
components those at the K' point. We emphasise that one must clearly dis- 
tinguish both types of isospin: (a) the sublattice isospin is represented by 
the Pauli matrices o\ where "spin up" corresponds to the component on one 
sublattice and "spin down" to that on the other one. A rotation within the 
SU(2) sublattice isospin space yields the band indices A = ±, and the band 
index (or band isospin) is, thus, intimitely related to the sublattice isospin. 
(b) The valley isospin, which is described by a second set of Pauli matrices 
r J , the ^-component of which appears in the Hamiltonian (2.36), is due to 
the twofold valley degeneracy and is only indirectly related to the presence 
of two sublattices. 

The eigenstates of the Hamiltonian (2.36) are the four-spinors 



*r + = -4 



M y/2 

where we have, now, 



/ 1 \ 

Ae^ 



V o j 



and ¥~ 



1 

7! 



/ \ 


1 

\ -Ae^ J 



(2.37) 



ip Q = arctan I — 



(2.38) 



Density of states 

The electronic density of states p(e) counts the number of quantum states in 
the vicinity of a fixed energy e. It turns out to be a very useful quantity when 
discussing the electric transport in the diffusive regime. It may be obtained 
from the total number of states below the energy e, 

N x=+ = g Yl =A f e de'p(e'), (2.39) 

where we have chosen the electronic n* band, by derivation with respect to 
the energy 

1 dN+ 

*) = 3-5-, (2.40) 



Continuum Limit 



37 



in terms of the total surface A 9 The factor g takes into account the de- 
generacy due to internal degrees of freedom - g = 4 in the case of graphene 
because of the twofold valley degeneracy and the electron spin. 

The calculation of the density of states may be rather involved because 
one needs to calculate a 2D integral over the wave vector as a function of 
energy, by inverting the energy dispersion. In the vicinity of the Dirac points, 
however, it may be calculated quite easily because of the isotropy of the 
energy dispersion (2.35). One may identify 

£ iqq ' 

k/e+<e 

where we have profited from the isotropy of the energy dispersion in the 
vicinity of the Dirac point, with the right hand side of Eq. (2.39), and one 
obtains 

By inverting the energy dispersion (2.35), one obtains 



P(e > 0) 



2nh 2 v 2 F ' 



10 



and for the full (positive and negative) energy range 

*> - A F • (2 - 42) 

as long as |e| <C \t\. 

The density of states, therefore, vanishes linearly at the Dirac points, 
at zero energy. This is a direct consequence of the linearity of the energy 
dispersion in the vicinity of the Dirac points. This particular situation needs 
to be contrasted to the conventional case of electrons in 2D metals, with an 
energy dispersion of e = h 2 q 2 /2m*, in terms of the band mass to*, where one 
obtains a constant density of states, p 2D {t) = gm* /2irh 2 . 

The full density of states, in the absence of nnn hopping corrections, has 
been calculated by Hobson and Nierenberger in 1953 [20]. It is schematically 



9 The density of states denned in this manner is indeed the density of states per unit 
surface, or unit volume in 3D. 

10 Thc negative branch of the density of states may be calculated in the same manner. 
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1 Energy 



Figure 2.3: Schematic plot of the density of states for electrons in graphene in the 
absence of nnn hopping. The dashed line indicates the density of states obtained at linear 
order in the continuum limit. 

shown in Fig 2.3. In agreement with the result (2.42) obtained at linear order 
in the continuum limit, it vanishes linearly at zero energy. The divergencies at 
±t, called van-Hove singularities, are due to the saddle points of the energy 
dispersion at the M points at the borders of the first BZ (see Fig. 2.2). 
This divergency may be understood qualitatively from Eq. (2.42) - roughly 
speaking, the derivative of the energy dispersion occurs in the denominator 
of the expression for the density of states and, thus, causes a divergency 
whenever it becomes zero, such as at the extrema or saddle points of the 
energy dispersion. 

Second order in |q|a 

Although most of the fundamental properties of graphene are captured within 
the effective model obtained at first order in the expansion of the energy 
dispersion, it is useful to take into account second-order terms. These cor- 
rections include nnn hopping corrections and off-diagonal second-order con- 
tributions from the expansion of 7 k . The latter yield the so-called trigonal 
warping, which consist of an anisotropy of the energy dispersion around the 
Dirac points. 

The diagonal second-order correction, which stems from the nnn hopping, 
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is readily obtained from Eq. (2.32), 



9a 2 



T-(t =t K^l 2 l~f K?( 1 )| 2 i = t I o I 2 1 

1 L nnn b nnn\ /q] — *>nnn\ /q I , ^nnn\H\ 



(2.43) 



The off-diagonal second-order corrections are somewhat more involved 
because we need to calculate the modulus of 7q , 



I !q\ ~ 

Because of 



07f ) | 2 + 2Re 7q «* 7 f)] 1/2 ^| 7 f)| 



1 + 



Re 7q (1 N q (2) 
l7 q (1) | 2 



Qn 2 

Re 7q a)* 7 f) = - H) > 

one obtains at second order 



i e\ 3a 
l7ql - irH 



1 - COs(3^ q ) 



where we have used the parametrisation q x = |q| cosy3 q and q y = |q| sin</? q . 
Finally, the energy dispersion (2.22) expanded to second order in |q|a reads 



9a' 



tnnnH 2 + \hv F \q\ 



q a 



1 -£^cos(3y? q ) 



(2.44) 



As we have already mentioned in Sec. 2.1.3, it is apparent from Eq. 
(2.44) that the nnn correction (which also accounts for nn overlap correc- 
tions) breaks the electron-hole symmetry e q ^ = — e q ^. This correction is, 
however, a rather small correction, of order \c[\at nnn /t, to the first-order ef- 
fective Hamiltonian (2.36). The second-order expansion of the phase factor 
sum 7q yields a more relevant correction - the third term in Eq. (2.44), of 
order |q|a \q\at nnn /t - to the linear theory. It depends explicitely on the 
valley isospin £ and renders the energy dispersion anisotropic in q around 
the K and K' point. The tripling of the period, due to the term cos(3<^ q ), 
is a consequence of the symmetry of the underlying lattice and is also called 
trigonal warping, as mentioned above. 

The trigonal warping of the dispersion relation is visualised in Fig. 2.4, 
where we have plotted the contours of constant (positive) energy in Fourier 
space. The closed energy contours around the K and K' points at low energy 
are sparated by the high-energy contours around the Y point by the dashed 
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Figure 2.4: Contours of constant (positive) energy in the wave- vector space, (a) Contours 
obtained from the full dispersion relation (2.22). The dashed line corresponds to the energy 
t+t nnn , which separates closed orbits around the K and K' points (black lines, with energy 
e < t + t nnn ) from those around the T point (gray line, with energy e < t + t nnn ). (b) 
Comparison of the contours at energy e — 1 cV, 1.5 eV, and 2 eV around the K' point. The 
black lines correspond to the energies calculated from the full dispersion relation (2.22) 
and the gray ones to those calculated to second order within the continuum limit (2.44). 
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Figure 2.5: Schematic view of an ARPES measurement (from Ref. [21]) The analyser 
detects the energy Ef of the photoemitted electron as a function of the angles i? and ip, 
which are related to the momentum of the electronic state. 

lines in Fig. 2.4 (a) at energy |t+t nn n| the crossing points of which correspond 
to the M points. As mentioned above, the dispersion relation has saddle 
points at these points at the border of the first BZ, which yield the van- 
Hove singularities in the density of states (see Fig. 2.3). In Fig. 2.4 (b), 
we compare constant-energy contours of the full dispersion relation to those 
obtained from Eq. (2.44) calculated within a second-order expansion. The 
contours are indistinguishable for an energy ofe= \t\/3 ~ 1 eV, and the 
continuum limit yields rather accourate results up to energies as large as 2 
eV. Notice that, in today's exfoliated graphene samples on a SiC>2 substrates, 
one may probe, by field-effect doping of the graphene sheet, energies which 
are on the order of 100 meV. Above these energies the condenser breaks down, 
and Fig. 2.4 (a) indicates that the continuum limit (2.44) yields extremely 
accurate results at these energies. 



2.3 Experimental Characterisation of the Elec- 
tronic Band Structure 

The energy dispersion relation of solids may be determined by ARPES (an- 
gle resolved photoemission spectroscopy). A typical ARPES measurement 
consists of the detection of the direction and energy of a photoemitted elec- 
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Figure 2.6: ARPES measurement of the dispersion relation of epitaxial graphene (from 
Ref. [22]). 



tron, i.e. an electron emitted from the solid to be studied after prior optical 
excitation by a photon of well-defined energy and momentum, Uv and Kk, 
respectively (see Fig. 2.5). The technique, which measures the spectral func- 
tion, the imaginary part of the electronic Green's function 1/ (u — ek x + in), is 
based on energy and momentum conservation in a electron-photon scattering 
process, 

%v = Ef — Ei, k = kj — kj, 

where Ei and kj represent the energy and momentum, respectively, of the 
initial electronic state and Ef and kj those of the final electronic state, i.e. 
the state of the photoemitted and detected electron. The direction of the 
latter, in terms of the angles d and (f shown in Fig. 2.5, yields precisely the 
momentum-dependence of the energy and thus the dispersion relation. 

Fig. 2.6 shows ARPES results obtained by Bostwick et al. on epitaxial 
graphene [22]. Similar results have been obtained on thin graphitic samples 
[23]. The studied sample has been obtained by graphitisation of the Si- 
terminated (0001) surface. Fig. 2.6 (a) shows the energy dispersion of the 
7r band and yields a good agreement with that calculated in Sec. 2.1.3 [see 
Fig. 2.2 (b)]. In order to obtain an ARPES signal, the initial state must be 
occupied, and one may see from Fig. 2.6 (a) that the Fermi level is in the tt* 
band at roughly Ep ~ 0.45 eV. This is in agreement with the statement in 
Sec. 1.3.2 that the graphene layer on SiC is naturally electron-doped due to 
charge transfer from the SiC substrate. Fig. 2.6 (b) shows the form of the 
first BZ, which is obtained by an energy cut through the dispersion relation 
at zero energy (at the Dirac points). The energy cut at the Fermi level, 
Ep ~ 0.45 eV, is shown in Fig. 2.6 (b) and reveals the circular Fermi surface 
around the K and K' points. The faint satellites are due to the underlying 
(hexagonal) structure of the SiC substrate. 



Continuum Limit 



43 



Finally, Fig. 2.6 (d) corresponds to a cut at —1.0 eV, i.e. in the ir band at 
an energy rather remote from the Dirac points. The abovementioned trigonal 
warping of the Fermi surface is clearly visible at this energy. 
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Chapter 3 



The Dirac Equation for 
Relativistic Fermions 

As we have discussed in the last chapter, the low-energy properties of elec- 
trons in graphene are described by the Hamiltonian (2.36), 

which happens to be of the form of the Dirac equation describing relativistic 
massless fermions. We, therefore, devote this chapter to a brief introduction 
to relativistic quantum mechanics with a particular emphasis on two spatial 
dimensions, which are of interest in the context of graphene. This chapter 
is not meant to be a complete review of relativistic quantum mechanics of 
quantum field theory, and we refer the interested reader to more specialised 
books on this fascinating field of physics [24, 25, 26]. We have, furthermore, 
tried to be as concrete as possible and to avoid formal aspects related to the 
Lorentz invariance of relativistic space-time, which may be useful for high- 
energy physicists but which are less relevant for condensed-matter physicists. 

After retracing the historical path, which led to the elaboration of rel- 
ativistic quantum mechanics and quantum field theory in the late 20ies of 
the last century, we concentrate on the particular form of the Dirac equation 
in 2D (i.e. two space and one time dimensions) and its amazing physical 
consequences such as the localisation problem, the resulting jittery motion 
("Zitterbewegung"), and Klein tunneling. 
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3.1 Relativistic Wave Equations 

In order to obtain a relativistic wave equation, one may proceed in the same 
manner as in the derivation of the Schrodinger equation for non-relativistic 
particles from classical Hamiltonian mechanics. Remember that the classical 
Hamiltonian of a particle in a potential V(r) is a function 

H(p,r) = £ + V(r), (3.1) 

defined in phase space, the space spanned by the particle's position r and 
its canonical momentum p. In quantum mechanics, the position and the 
momentum no longer commute, and one reinterprets Eq. (3.1) in terms of 
operators acting on vectors of a Hilbert space, when one replaces 

H -> ihd t and p -> -mV , (3.2) 

where V is the gradient with respect to the position r, and dt is the time 
derivative. This procedure yields the well-known Schrodinger equation 



ihd t ip(r,t) 



- A v , + v(r) 



V>(r,t) (3.3) 



of basic quantum mechanics, where the wave function ip(r,t) is the spatial 
representation of the Hilbert vector. 

One may indeed apply this procedure to a free relativistic particle of mass 
m the energy dispersion of which is given by 



E = ^m 2 c A + p 2 c 2 , (3.4) 

in terms of the velocity of light c. Whereas the taylor expansion for |p| <C 
mc yields the non-relativistic dispersion mc 2 + p 2 /2m, apart from the mass 
energy mc 2 , the ultra-relativistic limit of vanishing mass yields E = c|p|. 
This indicates that we need to treat, in relativistic (quantum) mechanics 
the energy E and the momentum p on the same footing, in contrast to 
the non-relativistic limit, where the energy is proportional to the square of 
the momentum. In relativistic mechanics, one treats indeed the energy, due 
to the Lorentz invariance, as the time-component of a D + 1 dimensional 
momentum vector, p M = (E/c,p), or its "covariant" vector = (E/c, — p), 
to account for the particular metric of the D+l- dimensional space-time. The 
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k 



Figure 3.1: Dispersion relation of a relativistic particle. The dispersion consists of two 
energy (positive and negative) branches, which are separated by twice the particle's mass 
energy mc 2 . 

energy dispersion is obtained from the Lorentz-invariant norm of this vector, 
= E 2 /c 2 — p 2 = m 2 c 2 , given in terms of the mass of the particle, 
which is an invariant scalar with respect to the Lorentz transformations in 
space-time. A slightly more detailed discussion of these aspects, concerning 
the covariance of special relativity, may be found in Sec. 3.2.2. 

3.1.1 Relativistic Schrodinger/Klein-Gordon equation 

Evidently, it is more useful, in order to avoid complicated functions of differ- 
ential operators, to introduce the "quantum" substitution (3.2) not directly 
in Eq. (3.4), but in its square. This substitution yields the equation 

-h 2 dfy(r, t) = (mV - k 2 c 2 V 2 ) ^(r, t), (3.5) 

which has been derived independently by Schrodinger, Klein and Gordon 
in 1926/27 and which is called relativistic Schrodinger or else Klein-Gordon 
equation. The latter name is more common, and we follow this convention 
here. 

The Klein-Gordon equation raises, however, more puzzling questions for 
the physicists of that time than it was supposed to solve. A first consequence 
of the Klein-Gordon equation is that it possesses solutions of negative energy, 
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as may be seen from the plane-wave ansatz 



u e 



-i(Et/h-k-r) 



which, substituted in Eq. (3.5), yields E 2 u = (m 2 c 4 + /i 2 c 2 k 2 )-u and, thus, 
the two solutions 

E ± 



± 



m 



2 c 4 + h 2 c 2 k 2 , 



(3.6) 

instead of a single positive one. Both solutions are plotted in Fig. 3.1. Notice 
that negative-energy states are not problematic in a classical theory because 
of the gap 2mc 2 between the two energy branches - negative and positive 
energy particles would simply not communicate. However, such states are a 
severe problem for a quantum theory, where the ground state, alias the lowest 
energy state, may not be correctly defined for an energy spectrum, which has 
no lower bound. A particle may, in principle, "fall" from a positive-energy 
state to one of negative energy by emitting, e.g., a photon the energy of 
which corresponds to the energy difference between the initial and the final 
states. We will discuss this problem in more detail in Sec. 3.3. 

Another problem of the Klein-Gordon equation (3.5) is related to its 
second-order time derivative. One would, in principle, need to specify not 
only if)(r,t = to), i.e. the wave function at some reference time to, but also 
its time derivative. As a consequence, the probability density p(r,t), which 
appears in the continuity equation 



^p(r,t) + V-j(r,t) = 0, 



(3.7) 



is no longer necessarily positive, in contradiction to a physicists intuition 
of a probability density. In order to illustrate this point, we consider first 
the non-relativistic Schrodinger equation (3.3). If we multiply Eq. (3.3) by 
ip(r,t)* and substract the result from the complex conjugate of Eq. (3.3) 
times ip(r,t), we obtain 

ih[^{v,tyd t ^{v,t) + lf>(T,t)dtlf>(T,t)*] 

= ~ [iKr, t)*VV(r, t) - V(r, t) V 2 ^(r, t)*} 



or else 

4Wt,t)V(r,t)]+V 



«r,t)-^V«(r,t)-*r,f)JLv,Kr,t)' 



= . 

(3.8) 
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Comparison with the continuity equation (3.7) yields the probability density 

P (T,t)=l/>(T,tyif>(T,t), (3.9) 

which corresponds to the probability of finding the quantum particle at the 
position r at time t, and the probability current density 

j(r,t) = ^[^(r,t)*V^(r,t)-^(r,t)V^(r,t)*] . (3.10) 

If we proceed in the same manner for a quantum particle described in 
terms of the Klein-Gordon equation (3.5), one obtains the equation 

[^(r,t)*^(r,t) - TP(r,t)dfy(r,t)*} 

- c 2 ty(r, t)*V 2 ^(r, t) - V>(r, t) V 2 ^(r, f)*] = 0, 

and the corresponding probability and probability current densities are, thus, 
after multiplication of the above equation by ih/2mc 2 , 

p(r, t) = — [V>(r, tyd^(r, t) - ^(r, t)M(r, *)*] (3.11) 

and 

j(r,t) = ^[^(r,t)*V^(r,t)-^(r,t)V^(r,t)*] , (3.12) 

respectively. Whereas Eq. (3.12) coincides with the probability current den- 
sity (3.10) of the Schrodinger case, the probability density (3.11) is clearly 
different from Eq. (3.9). In contrast to our intuition and non-relativistic 
quantum mechanics, this probability density may become negative, and it 
is, therefore, no longer possible to interpret it as the probability of finding a 
(relativistic) quantum particle at the position r at time t. 



3.1.2 Dirac equation 

The above-mentioned problems of the Klein-Gordon equation led Dirac, in 
1928, to propose an alternative relativistic wave equation [27]. Whereas its 
solutions yield positive probability densities, which are more straight-forward 
to interpret physically, it did not solve the problem of negative energy states. 
The latter have finally found a compelling interpretation by Dirac in terms 
of anti-particles, as we will discuss later in this chapter. 
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In order to avoid negative probability densities, Dirac understood that 
one had to search for an equation that is linear in E, i.e. of first order in 
the time derivative dt- For dimensional reasons, the only possible ansatz for 
such an equation is 

H = (3mc 4~ col ■ p — > %h [d t + cot ■ V] i/>(t, t) = /WV(r, *), (3.13) 

where (3 and 1 ex. = (a±, an) are dimensionless objects the mathemati- 
cal properties of which are fixed by the requirement that the square of the 
operator equation (3.13) must satisfy the relativistic dispersion relation 

E 2 = m 2 c 4 + c 2 p 2 = ftm 2 ^ + c 2 (a • p) 2 + mc 3 ([3ct p + a p(3) . 

In order to satisfy this equation, the introduced mathematical objects must 
anti-commute, 

(3 2 = 1, { ai ,f3} = 0, and {a i ,a,} = 2%, (3.14) 

in terms of the anti-commutator {A, B} = AB + BA. Eq. (3.14) defines 
the so-called Clifford algebra, and one immediately realises that the objects 
(3 and aij must be N x N matrices, with N > 1. We summarise their main 
properties. 

(i) Because the Hamiltonian (3.13) is Hermitian, the matrices must be 
Hermitian, 

cti = ot\ and (3 = ft . 

(ii) Furthermore, because of a 2 = (3 2 = 1 [see Eq. (3.14)], the matrices 
must be unitary 

ai = a\ = a' 1 , and (3 = ft = 

and their eigenvalues are, therefore, ±1. 

(iii) If one applies the determinant to the anti-commutator, 

one obtains with the help of the elementary properties of the determi- 
nant, 2 

det^Oj) = det(— ajCti) = (— 1) N det (a j(Xi) = (— 1) N detfaiOtj). 

In order to satisfy this equation, one needs to choose iV to be even. 
The lowest possible value is, therefore, N = 2. 

1 One needs to introduce one on for each space dimension. 

2 We use the fact that if one multiplies an TV x N matrix A by a scalar A, one has 
det(XA) = X N det(A), and that the determinant of a product of two matrices A and B is 
the product of their determinants, det(AB) = det(A) det(B). 
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Influence of the space dimension on the matrix dimension N 

Notice that the dimension iV of the matrix must not be confunded with the 
space dimension D. The latter influences, however, the possible choices for 
N. 

In the case of two space dimensions (D = 2), which is of interest in 
graphene, we have three mutually anti-commuting objects, f3, ai, and 0:2- It 
happens that there exist, with the Pauli matrices, 

-(!!)•"(? )••■-(; A). <-» 

a set of three mutually 2x2 matrices, which satisfy the Clifford algebra 
(3.14). One may identify 3 f3 = <j z , ol\ = & x , and a 2 = cr y , which yields the 
2D Dirac Hamiltonian 

H 2D = c<x-p + mcV, (3.16) 

where a = (a x , a y ), as in the previous chapter. If we identify c with the Fermi 
velocity vf, one immediately sees that this Hamiltonian, for m — 0, has the 
same form as the effective Hamiltonian (2.36) derived in the last chapter for 
the description of the low-energy properties of electrons in graphene, at the 
K' point (£ = 1). The effective Hamiltonian at the K point (£ = —1) may 
be identified with a representation of the Dirac Hamiltonian in terms of the 
matrices (5 = —cr z , a\ = —cr x , and a 2 = —& y , which are an equally valid 
choice for representing the Clifford algebra (3.14). 

For three space dimensions (D — 3), the representation of the matrices 
f3 and ai is slightly more involved. Whereas one may choose again the 2x2 
Pauli matrices for zero-mass particles, i.e. when one may omit the matrix j3, 

H%4 = ca-p, (3.17) 

with e.g. ai = cf x ', 012 = & v , and 03 = a z , this is no longer possible for 
massive 3D particles, as may be seen from the following argument. Let 
us assume that the Pauli matrices represent a±, a 2 , and /3, respectively. 
One must, therefore, find a 2 x 2 matrix, a 3 , which anti-commutes with 
the Pauli matrices. However, the latter also form, together with the 2x2 
unit matrix 1, a basis of all 2 x 2 matrices. One may, thus, decompose 



3 Notice that all other permutations are equally possible. 



52 



The Dirac Equation for Relativistic Fermions 



a 3 = a t + d\o x + a 2 <J y + a s a z , in terms of the real numbers a , ai, a 2 , and 
a 3 . The anti-commutation requirement {0:3, a % } = 2(a + a^) = is satisfied 
only if we choose Oj = — ao, for all values of %. However, this is at odds with 
the condition a\ = 1 because one would have a§ = 2a (2 — a x — a y — a z ). It 
is, therefore, impossible to represent the matrices (5 and at by 2 x 2 matrices. 

Due to the condition that N must be even, the lowest possible value for 
representing massive 3D particles is N = 4, and indeed one may satisfy the 
Clifford algebra (3.14) if one chooses, e.g., the so-called standard representa- 
tion 

P = ( J -1 ) and Qi = ( a* ) ' ^ 3,18 ' ) 

where each component is a 2 x 2 matrix, as may be verified in a straight- 
forward manner. As a consequence, the Dirac Hamiltonian has also a 4 x 4 
matrix structure, 

Hl%= ( cp.o- _^c 2 l ) ' (3 ' 19) 
and the quantum states must be represented by 4-spinors, 

* = 

in terms of the 2-spinors 

- d *-(£)• 

In order to obtain a better understanding of this spinor form, we may inves- 
tigate the most simple case of a particle in its rest frame, i.e. with p = 0, in 
which case the Hamiltonian (3.19) becomes diagonal. The 2-spinors <j) and x 
are, therefore, decoupled, 

mc 2 (f) = E<f> and — mc 2 x = E\- 

One immediately notices that the components correspond to components 
of positive energy (particle components) and x to those of negative energy 
(anti-particle components). One may furthermore show that the components 
0i, 02 and X\i X2 of the 2-spinors and Xi respectively, correspond to the 
different possible spin states of the particle [24, 25]. 
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Continuity equation 

We finally notice that the Dirac equation yields positive probability densities, 
which were Dirac's main motivation to propose an alternative to the Klein- 
Gordon equation, as mentioned above. In order to obtain the continuity 
equation (3.7), we proceed in the same manner as for the Schrodinger or 
the Klein-Gordon equation, i.e. we multiply the differential equation (3.13) 
from the left by the spinor ij;'(r,t) and subtract the Hermitian conjugate of 
Eq. (3.13), multiplied from the right by ip(r,t). This yields the continuity 
equation 

d t [^( r ,t)^(r,t)] +cV- [^(r,t)a^(r,t)] , 
and one thus obtains the positive probability density 

P (T,t)=^(T,t)lj>(T,t) 

and for the probability current density 

j(r,t) = c^(r,t)aip(r,t). 

3.2 The 2D Dirac Equation 

After this short excursion to the 3D Dirac equation, we now concentrate on 
the 2D equation (3.16), 

H 2D = p-(T + ma z , 

which interests us in the case of graphene, where we choose a unit system 
with c = h — 1, for the remainder of this section. We may rewrite the 
Hamiltonian as 

2D _ J cos 13 sin/3e-^p \ ( . 

H ~ e ^sin/3e^p -cos/3 )> [6 ' Zi) 

where we have defined e = y/ 'm 2 + p 2 , cos/3 = m/e, sin (3 = |p|/e, and 
ip p = arctan(p y j p x ) , as in Eq. (2.38) for the wave vector q, which is replaced 
by the momentum p. The eigenvalues of the unitary and Hermitian matrix 
on the r.h.s. of Eq. (3.21) are A = ± and correspond to the positve and 
negative energy states, respectively, and we have 

E x = Xe = Av^ + P 2 , (3.22) 

in agreement with Eq. (3.6). 
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3.2.1 Eigenstates of the 2D Dirac Hamiltonian 



In order to obtain the eigenstates, we make the plane-wave ansatz for a 
2-spinor, 



*A, P (r) 

and one obtains from its substitution in Eq. (3.21) 



J_ ( u x 



(3.23) 



(cos (3 - X)u x + sin [3e~ ilpp vx = 0. 

A possible choice for the 2-spinor components, which respects the normali- 
sation, is 

u\ = a/1 + Acos/9 and v\ = Aa/1 — Acos/9e^ p , 
which yields for the eigenstates 

3 ipr . (3.24) 



*A, P (r) = 



V2 



\ 



XJ1- 



One notices that the ultra-relativistic limit (m 

states, 

l/i \ . 

,ipr 



1 

\ e i<Pp 



0) yields the same 



(3.25) 



as those of low-energy electrons in graphene (2.37) if we consider one Dirac 
point K or K'. These low-energy electrons may, therefore, indeed be viewed 
as massless 2D relativistic particles. In the opposite limit of large mass 
(m 3> |p|), one obtains, as in the 3D case, two decoupled equations for the 
components, with 



A=+,pl 



*A=-, P (r) 



ip-r 



apart from an unimportant global phase factor in the second line. In the 
case of small, but non-zero, values of the momentum, one obtains a small ad- 
mixture by the "small" components V\=+ and U\=~- In relativistic quantum 
mechanics, the components u\=+ and v\=- are sometimes also called "large" 
components. 
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*'G = -) 



momentum 



Figure 3.2: Relation between band index A, valley isospin £, and chirality -q in graphene. 

3.2.2 Symmetries and Lorentz transformations 
Chirality for zero-mass particles 

In high-energy physics, one defines the helicity of a particle as the projection 
of its spin onto the direction of propagation, 



which is a Hermitian and unitary operator with the eigenvalues r] — ±, 



Notice that <r describes, in this case, the true physical spin of the particle. In 
the absence of a mass term, the helicity operator commutes with the Dirac 
Hamiltonian, and the helicity is, therefore, a good quantum number, e.g. 
in the description of neutrinos, which have approximately zero mass. One 
finds indeed, in nature, that all neutrinos are "left-handed" (77 = — ), i.e. 
their spin is antiparallel to their momentum, whereas all anti-neutrinos are 
"right-handed" (rj = +). 

For the case of graphene, described in terms of the 2D Dirac equation 
(3.13), one may use the same definition (3.26) for a Hermitian and unitary 
operator h p . Here, the Pauli matrices no longer define the true spin, but the 
sublattice isospin, as described in Sec. 2.2. Because of this difference, one 
also calls h p the chirality operator. It clearly commutes with the massless 




p ■ cr 



(3.26) 



P 




(3.27) 



56 



The Dirac Equation for Relativistic Fermions 



2D Dirac Hamiltonian (3.16), and one may even express it in terms of the 
chirality operator 

H 2D = \p\h p , and H p >=£H 2D = Z\p\h p , 

where the second expression takes into account the two-fold valley degener- 
acy, in terms of the valley isospin £ = ±. The band index A, which describes 
the valence and the conduction band, is, therefore, entirely determined by 
the chirality and the valley isospin, and one finds 

A = & , (3-28) 

which is depicted in Fig. 3.2. 

Lorentz transformations of scalars and vectors 

Although the covariance of the Dirac equation and Lorentz transformations 
in general are less relevant for the condensed-matter physicist working on 
graphene, we intend to give a very brief overview on these slightly more 
formal aspects. They allow us to define a parity and a time-reversal operator, 
which are the special forms of Lorentz transformations useful in the context 
of graphene. 

In special relativity, all physical laws must be invariant under the Lorentz 
transformations (e.g. rotations and Lorentz "boosts") of the D + 1 dimen- 
sional space time. This the meaning of the statement that all physical equa- 
tions must be covariant. As for the more common D-dimensional space, one 
may distinguish different mathematical objects in this space with respect to 
their behaviour under a space symmetry transformation 4 TZ: a scalar remains 
invariant under this transformation, 

TZ : s — > s' = s , 

whereas a vector is transformed by the particular law 

H\ v -> v' = Rv, 

where R is the rotation matrix associated with TZ. One may, furthermore 
define a tensor (of rank r), all the r components of which are transformed 
by the rotation matrix 

4 We consider, for simplicity a rotation in D = 3, which is a symmetry operation for an 
isotropic space. 
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We may proceed in the same manner in the case of the D + l dimensional 
space-time, where a scalar is invariant under a Lorentz transformation C, 

C : s — > s' = s, 

whereas aD + 1 vector transforms as 5 

£ : v'"' = A^u", (3.29) 

in terms of the (D + 1) x (D + 1) matrix A. The generalisation to a tensor 
is again straight-forward. In contrast to the above-mentioned D-dimensional 
space, one must clearly distinguish between vectors with an upper index and 
those with a lower one. This is due to the special metric of space-time 



/ 1 
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-1 
















-1 







\o 








-1 


/ 



and the vectors with upper and lower index are related by the metric v M = 
g^ v v u . Thus, if = (i> ,v), t> M = (i>o, — v), where v regroups the D space 
components and v is the time component of the D + l vector. One may, 
furthermore, define the scalar product between two D + l vectors as 

(v,w) = tAiv , 

which must, as the name "scalar" indicates, be invariant under a Lorentz 
transformation. As an example to illustrate these rather formal aspects, we 
consider the position vector = (ct, r) in space-time or the momentum 
vector = {E/c, p), which we have already alluded to at the beginning of 
this chapter. The scalar products yield the mass term, 

u 77i2 / 2 2 2 2 

p^p^ = E /c — p = m C : , 
which is indeed a scalar, and the scalar 

x » Xlx = c 2 t 2 - r 2 . (3.30) 

5 We use the Einstein convention, where one sums over repeating (upper and lower) 
indices omitting the sum sign, 

w % = Yl v ^ v ^ ■ 

A* 
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Instead of a classification of all types of Lorentz transformations, which 
may be found in standard text books on relativity and quantum field the- 
ory, e.g. [24, 25], we simply discuss some explicit examples. The simplest 
example is a rotation in the space part, which does not mix time and space 
components. One may indeed use the D-dimensional rotation matrix R, and 
embed it into a (D + 1) x (D + 1) matrix, 



A less trivial example is the so-called Lorentz boost, a transformation into 
a frame of reference that moves at the speed v with respect to the original 
one, in a direction that we may choose to be the x-direction. In this case, we 
need to account for the invariance of the scalar product x^x^ in Eq. (3.30), 
and we need c 2 t 2 — x 2 = c 2 t' 2 — x' 2 , because the y- and ^-components are 
not affected by the transformation. The transformation, which respects this 
metric, may be expressed in terms of hyperbolic functions, 6 



of the "angle" £, which is also called rapidity, may be related to the velocity 
v of the Lorentz boost, by 



which is a manifestation of the time dilatation and the Lorentz contraction 
in a Lorentz boost. In matrix form, we may thus describe the Lorentz boost 
in the x-direction by 




ct' = ct cosh ( — x sinh ( 
x = —ct sinh ( + x cosh ( , 



which respects the metric because of cosh 2 £ — sinh 2 ( = 1. These functions 




and 





/ coshC -sinhC \ 
— sinh ( cosh ( 



(3.31) 



10 

\ 1 / 



6 Rcmembcr that for a metric which leaves x 2 + y 2 invariant, we would have chosen 
trigonometric functions, which define indeed the rotation. 
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However, Lorentz boosts play a subordinate role in the discussion of elec- 
trons in graphene, as compared for example to the parity transformation, 
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(3.32) 



which transforms the space components, r — > — r, but leaves the time com- 
ponent invariant, and the time-reversal transformation 
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(3.33) 



which inverts the direction of time flow, t — > —t, but leaves invariant the 
space components. 



Lorentz transformations of spinors 

In the last paragraph, we have briefly reviewed the covariance of classical 
relativistic mechanics, i.e. discussed how the laws of physics, in terms of 
scalars, vectors, and tensors, behave under Lorentz transformations. If we 
consider the quantum aspect of the laws of physics, we must also analyse 
how the state vectors (spinors) in the Hilbert space of quantum mechanics 
transform in a Lorentz transformation. The spinor in a transformed frame 
of reference must be related to that of the original one, 

V'(r', t') = S(£)i/,(r, t) & V'(r', f ) = S(£)i/, [(A" 1 ) , (3.34) 

where (A -1 )^ is the inverse of the matrix, which corresponds to the Lorentz 
transformation C, and S(C) is an A" x N matrix that acts on the spinor. 

It is, furthermore, convenient to rewrite the Dirac equation (3.13) in a 
covariant manner, 

(zvy"-mc)^(r,f) =0, (3.35) 

where we have defined the 7 matrices 7 = (3 and 7* = f3a l . In order for 
this equation to be Lorentz invariant, i.e. valid in any frame of reference, we 
must have 

(p'rf-mc) V>'(iV) = 0, 
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which yields, with the above definitions, 

[S~\C)^p u ^S{C) - mc] V>(r, t) = 0. (3.36) 
Comparison with Eq. (3.35), thus, imposes 

S-\C)YS(C) = A^y". (3.37) 

Parity and time reversal 

Eq. (3.37) allows us to construct the matrices S(C) for the different Lorentz 
transformations. As already mentioned above, we are not interested, in the 
case of graphene, in those corresponding to Lorentz boosts. Instead, we con- 
centrate on the parity and time-reversal operations for the 2D Dirac equation. 
Notice that, in this case, the 7 matrices read 

7 = o z and Y = o z o\ (3.38) 

for i — x,y. 

For the parity operator, Eqs. (3.33) and (3.37) impose 

S- 1 (P)a z S(P) = o z and S- 1 (P)a z a i S(P) = -a z a\ 
One notices that the choice 7 

S(P) = o z V (3.39) 

satisfies this equation, where V replaces r by — r in the argument of the 
spinor. Notice that the eigenstates of the parity operator are only eigenstates 
of the 2D Dirac Hamiltonian in the particle's rest frame for m 7^ and p = 0. 
In their rest frame, particles and anti-particles have, thus, opposite parity. 

The definition of the time-reversal operator is slightly more involved. Eqs. 
(3.34) and (3.37) impose 

S- 1 (PT)a z S(T) = -a z and S , - 1 (T)<rVS , (T) = a z a\ 

In this case, one needs to accompany the 2x2 transformation matrix a y by 
the conjugation operation C : z — > z*, and one finds 



S(T) = a y CT, 
7 An equally valid choice is a z multiplied by a phase factor exp(i</>). 



(3.40) 
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where T replaces t by —t in the spinor argument, which only plays a role in 
non-stationary spinors. We may investigate the effect of S(T) on the state 
(3.25) ^^ k (r) at the K point in graphene, 

sm*u*) = ^ ( A T ) e " lkr K ^ 

where we have used exp(iy?_k) = exp[i(7r — </?k)] = — exp(iy2k). One, thus, 
notices that the time reversal operation maps a state at the K point at 
wave vector k onto a state at K' at wave vector — k. 8 It is indeed a general 
statement that time reversal yields k — > — k and that time-reversal symmetry 
in a crystal, therefore, leads to a band dispersion with e_k = £k- 

3.3 Physical Consequences of the Dirac Equa- 
tion 

Limits of the validity of the Dirac equation if interpreted as a one-particle 
quantum theory 

3.3.1 Minimal length for the localisation of a relativis- 
tic particle 

3.3.2 Velocity operator and "Zitterbewegung" 

3.3.3 Klein tunneling and the absence of backscatter- 
ing 



8 Notice that comparison with Eq. (2.37) requires the interchange of the sublattice 
components of the spinor at the K' point, as mentioned in Sec. 2.2. 
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